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Abstract. We are now witnessing a rapid growth of a new part of group the- 
ory which has become known as "statistical group theory" . A typical result in 
this area would say something like "a random element (or a tuple of elements) 
of a group G has a property P with probability p" . The validity of a statement 
like that does, of course, heavily depend on how one defines probability on 
groups, or, equivalently, how one measures sets in a group (in particular, in a 
free group). We hope that new approaches to defining probabilities on groups 
outlined in this paper create, among other things, an appropriate framework 
for the study of the "average case" complexity of algorithms on groups. 
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1. Introduction 



A new part of group theory, often called "statistical group theory" , is becoming 
increasingly popular since it connects group theory to other areas of science, most 
of all to statistics and to theoretical computer science. 

A typical result in this area would say something like "a random element (or a 
tuple of elements) of a group G has a property P with probability p" (see e.g. iQ, 
1^, ||18|). The validity of a statement like that does, of course, heavily depend on 
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how one defines probability on groups, or, equivalently, how one measures sets in 
a group. This is the problem that we address in the present paper; we feel that it 
deserves some discussion at a general methodological level. A much more technical 
development of some of the ideas can be found in |^ . 

Our approach to statistical group theory is formed by needs of practical com- 
putations with infinite groups. In particular, the starting point of our study of 
measures on groups was the desire to identify measures which can be used in the 



analysis of the behaviour of genetic algorithms on infinite groups [15, 16, 22|. 
Hence one of our main requirements for a measure is that it reflects the nature of 
algorithms used for generating random or pseudo-random elements of a group. In 
Section |^ we discuss some other natural conditions which would make a measure 
suitable for practical computations. 

Since we focus mostly on finitely generated discrete groups, almost all measures 
defined in the present paper are atomic. Recall that a probabilistic measure on a 
countable set is atomic if every subset is measurable. Clearly, the latter condition 
is natural in the context of computational group theory where we are restricted 
to dealing with finite sets of elements. In Section ^, we suggest a simple (perhaps 
even naive) general approach to constructing atomic measures on countable groups 
associated with various length (or "complexity") functions. 

In Sections ^, H, we look at the problem of measuring sets in infinite groups 
when pseudorandom elements of a group are generated by a deterministic pro- 
cess. This naturally leads to Kolmogorov complexity of words as a more adequate 
concept of the "length function" on a free group. The invariance theorems for 
Kolmogorov complexity provide for a uniform treatment of different ways to define 
computational complexity functions on groups. 

Section |^ deals with the so-called short elements bias which occurs in any gen- 
erator of random elements of a group G with a given atomic probability distribution 
fj,. This effect is based on a simple observation that the measure IJ,{S) of an infinite 
set S' C G is essentially defined by a few short elements from S. There are several 
principal approaches to this problem. The most popular one suggests to consider 
relative frequencies 

\snBk\ 

(where Bk is the ball of radius k in the Cayley graph of G with respect to a given 
set of generators) and their behaviour at "infinity" when fc ^ oo. This leads to the 
notion of asymptotic density of S defined as the following limit (if it exists): 

\snBk\ 

'^^^^.'t^^bt- 

Another way to avoid the short elements bias is to replace a single measure fi on 
G by a parametric family of distributions {(J-l} in which L is the average length of 
elements of G relative to fi^ ■ In this case the asymptotic behaviour of the set S is 
described by the limit 

^00(5") = lim ^l(S'). 

L — >oc 

We discuss asymptotic densities in the last section and refer to |Q for a detailed 
discussion of the second method. 

In Section ^ we show how a specific choice of the probability distribution on a 
free group allows one to introduce degrees of polynomial growth "on average" for 
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functions on groups. In particular, it is a possible way for introducing hierarchies 
of the average case complexity of various algorithms for infinite groups, making 
meaningful statements like "the algorithm works in cubic time on average" . 

Section ^ discusses probability measures generated by random walks on the 
Cay ley graph of a free group F„. This class of measures is well suited for analysis 
of randomized algorithms on groups and behaves nicely with respect to taking finite 
factor groups. 

In Section we analyze some limitations of the asymptotic density as a tool for 
measuring sets in infinite groups. The most important one is that it is not sensitive 
enough, i.e., many sets of intuitively different sizes have the same asymptotic density 
(usually 1 or 0). Indeed, in p^ , Woess proved that every normal subgroup of 
infinite index in a free group Fn, n ^ 2, has asymptotic density 0. 



Another disappointment is offered by Theorem 10.4 which states that the set 
of primitive elements of a free group -F^, n ^ 2, has asymptotic density 0. In fact, 
our proof provides lower and upper bounds for the relative frequencies of the set of 
primitive elements of F^, which is a result of independent interest. 

One way around this problem is to consider the so-called growth rate of the 
relative frequency defined by 



7(5)=limsup -tVM- 

k — >oo 

This is a useful characteristic of growth of the set S and it is more sensitive than 



the asymptotic density p{S) (see Section 10.2) 



On the other hand, the function 7(5) is not even additive, which makes it 
difficult to use as a measuring tool. 

It would be very interesting to check whether sets having the same asymptotic 
densities (see, for example, |18|) will show different sizes with respect to a 

more sensitive and adequate measure. 



2. Conditions on a measure 

Let G be a group (finite or countable infinite) . A probabilistic measure fi on G 
has to satisfy the standard axioms of a probability space. 

Recall that a probability space is a set X together with a a-algebra of subsets 
A oi X and a probability measure fi : A — > R which is a real-valued function 
satisfying the following axioms: 

(Ml) For any set S C A, fi{S) ^ 0. 
(M2) fi{X) ^ 1. 

(M3) If e /, is a countable collection of pairwise disjoint sets from A, then 




Analysis of behaviour of randomized algorithms on groups requires estimation 
of the probability of a given element. A measure /i on X is atomic if it satisfies the 
following condition 

(M4) X is countable and every subset S C X is ^-measurable. 

In this paper, we shall consider mostly atomic measures on a free group Fn 
of rank n with basis {xi,. . . ,a;„}. Note that since every coset with respect to a 
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subgroup H < Fn is measurable, this defines an induced measure on the factor set 
Fn/H; this new induced measure is also atomic. 

A question that might arise (and which is usually addressed in the theory 
of growth of groups) is whether or not a measure should depend on a particular 
basis of Fn- Independence of a basis means invariance of the measure under the 
action of Auti^„. In the case of atomic measures, this implies that infinitely many 
singletons {g^}, h E Fn, have the same measure as {g} does, which implies fj,{g) — 0. 
Hence the only (Aut i^„)-invariant atomic measure on Fn is the singular measure 
concentrated at the origin 1 , and atomic measures necessarily depend on the choice 
of a basis in F„ . 

Finally, we record several informal conditions on a measure which will guide us 
in evaluating behaviour of various measures: 

(CI) a measure should be natural, i.e., it should meet our expectations of what 

the sizes of various sets in a group are. 
(C2) a measure should be sensitive, i.e., sets that (intuitively) seem to be of 

different sizes, should have different measures. 
(C3) niS) (or, at least, rather tight bounds for fJ,{S)) should be easily computable 

for "natural" sets S C Fn- 
(C4) a measure should admit a natural generator of random elements in the group. 

In the rest of the paper we consider various approaches to constructing measures 
on groups, checking them against our list of conditions (C1)-(C4). 



3. Atomic probability measures 

In this section, we discuss a general method which allows one to define atomic 
probabilistic measures on countable groups. Our definition of a measure is based on 
a notion of "complexity" of elements of G and a probability distribution on the set 
of all non-negative integers A/". These are the initial basic objects which determine 
the intrinsic behaviour of the corresponding measure. 

Depending on a problem at hand, one can use different types of complexities, 
for example, it can be a descriptive complexity, a computational complexity, the 
minimal length of the word representing an element with respect to a given gener- 
ating set, or the length of the normal form of an element. In general, a complexity 
function (or a complexity) on a group G is an arbitrary non-negative integral func- 
tion c : G — > Af such that for every n E N the preimage c~^(n) is a finite subset 
of G- Note that a group G with a complexity function is countable. At the end 
of the section we discuss more general complexity functions which allow elements 
of G to have infinite complexity and elements with real value complexities (in this 
case the group G can be uncountable). 

The most important example of a complexity function is the length function on 
a group. Let G be a finitely generated group with a given finite set of generators 
S C G- For an element 5 e G by Isig) we denote the minimal non-negative integer 
n such that 5 = yi . . . y„ for some yi £ S U - If is a subgroup of G then the 
restriction of ^5 on H gives rise to a new complexity function on H, which might 
not be a length function on H- 

In Section ^ we consider another important type of complexity functions on G, 
so-called Kolmogorov complexity functions. 
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For a given complexity function c on G and a discrete probability distribution 
on non- negative integers M , given by a density function d : M — > R, we are going 
to construct an atomic measure on G. 

Recall that, for an atomic measure on a countable set X, the value of fi on an 
arbitrary subset 5 C X is defined uniquely by the formula 

(3.1) M^) = ^mH. 

This shows that an atomic measure /x is completely determined by its values on 
singletons {w}, w € X.ln other words, to define /x it suffices to define a function 
p : X — > R (and put ii{w) = p{w)), which is called a probability mass function or 
a density function on X, such that: 

p{w) > for all X e X, 



E 



p{w) = 1. 



Now we put forward one more condition on the measure /x which ties /z. to a com- 
plexity function c : G ^ N: 

(C5) for any u,v G G, c{u) = c{v) implies = fJ.{v). 

We call a measure fi on G c-invariant if satisfies (C5), i.e., elements of the same 
c-complexity have the same measure. 

The following result describes c-invariant measures on G. For k & M define the 
fc-sphere Ck with respect to a complexity c as follows: 

Ck = {w&G\ c{w) = k}. 

Similarly, by B„ we denote the disc or the ball of radius n with respect to c: 

Bn = {w€G\ c{w) < n}. 

Lemma 3.1. Let fi be an atomic measure on G and c a complexity function on 
G. Then: 

(1) If fj, is c-invariant, then the function d^ : Af — > R defined by 

dfi : k — > At(C/c) 

is a probability measure on N ; 

(2) if d : Af — > R is a probability measure on Af, then the function pc,d '■ G — > 
TZ defined by 

d{c{w)) 

PcdH = jcTT 

is a probability function which gives rise to an atomic c-invariant measure 
on G. 



Proof is obvious. □ 

Remark 3.2. If c{w) is the length of an clement w G G with respect to a given 
finite set of generators of G, then c-invariant measures play an important role in 
asymptotic group theory. We will call such measures homogeneous. 

Since d{k) — > as fc — > oo, the following elementary but fundamental prop- 
erty of /ic,d holds. 
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More complex ( with respect to a given complexity c ) elements of a set 
S d X contribute less to ^J.c.d{S). 

This discussion shows that in defining the probabihty measures fic.d on G, 
everything boils down to the problem of choosing a complexity function c : G — > J\f 
and a moderating distribution d : J\f — > R. This choice might depend on a 
particular problem one would like to address. To that end, it seems reasonable to: 

(a) use complexity functions on G which reflect intrinsic features of the problem 
at hand; 

(b) use probability distributions on J\f which reflect the nature of the process 
which generates (pseudo)random elements of the group, or which allow to 
study the statistical properties of the group in a meaningful and computa- 
tionally feasible way. 

We discuss possible complexity functions on groups in the next section. Here 
are some well-established parametric families of density functions on Af that we 
have considered in this framework: 
(i) The Poisson density: 



kl 



dxik) = 

(ii) The exponential density: 

d^{k) = (1 - e--^)e-^'= 

(iii) The standard normal ( Gauss) density: 

da,a{k) = 6 • e 

(iv) The Cauchy density: 

dx{k)^b- 

(v) The Dirac density: 



(fc- A)2 + 1 



dm{k) 



1 if fc = TO 
if fc 7^ TO 

The following density function depends on a given complexity function c on G: 
(vi) The finite disc uniform density: 



dm{k) 




if fc < TO 
otherwise 



For example, the Cauchy density functions are very convenient for defining 
degrees of polynomial growth "on average" , while the exponential density functions 
are suitable for defining degrees of exponential growth "on average" (see Section ^. 

On the other hand, different distributions arise from different random genera- 
tors of elements in a group. For example, Dirac densities correspond to the uniform 
random generators on the spheres C„; finite disc densities arise from uniform ran- 
dom generators on the discs i3„. Measures on the free group Fn associated with 
the exponential moderating distribution on Af have especially nice properties and 
are studied in some detail in |Q. 
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Remark 3.3. Sometimes we allow for some elements in the set X to have 
infinite complexity oo. In this case we consider complexity functions of the type 
c : X — > N U {00} such that c~^{n) is a finite subset of X for every integer n (so 
that we may have infinitely many elements in X with complexity 00). Following 
our requirement that more complex elements contribute less to the measure, we 
define the probability density function p on X for elements with finite complexity, 
and for elements x G X oi infinite complexity we just put p{x) = 0. 



4. Kolmogorov complexity functions 

4.1. Kolmogorov complexity functions on free groups. In this section 
we discuss complexity functions on free groups. We are going to represent elements 
of free groups as reduced words in a given basis, so the starting point of this 
discussion is complexity of words. 

Let A be a finite alphabet and A* the set of all finite words in A. The length 
\w\ of a word w = xi . . . a;„, Xi € A, is equal to n. The length function I : w — > \w\ 
maps A* into TV. Observe that l^^{n) = It follows that 

• The length function / : A* — > A/" is a complexity function on A* . 

This is one of the basic complexity functions that we will consider in this paper. 
The problem however is that some very long words in A* do not look very complex. 
For example, the word a^" , where a Q A, has length 10^°°, but this word is very 
easy to describe. This leads to the notion of Kolmogorov complexity, or descriptive 
complexity, or sometimes it is called algorithmic complexity. We refer to |14 , | |25| 
for detailed treatment of Kolmogorov complexity. 

Intuitively, Kolmogorov complexity of a word w d A* is the minimum possible 
size (length) of a description of w with respect to a given formal general procedure. 
For example, one may think of the descriptive complexity of w as of the the mini- 
mum possible size of a program, in a given programming language, which produces 
w after finitely many steps. In this case, the word a^*^ above will have complexity 
much less than 10^"°. 

Now we give a formal definition of Kolmogorov complexity. 

It is sufficient to consider programs of a very particular type, say, Turing ma- 
chines. Denote hy B = {0, 1} the standard binary alphabet and by B* the set of 
all finite binary strings. Every Turing machine M determines a partial (perhaps 
empty) recursive function /m : B* — > A* . The function /m is defined on x G B* 
if and only if the machine M starts on the tape with a word x, halts in finitely 
many steps, and a word w G ^* is written on the tape. In this event, fM{x) — w. 
Denote by SMix) and tM{x), respectively, the tape space and the number of steps 
needed for M to write w and halt. 

The Kolmogorov complexity of a word w G A* with respect to a given machine 
M is defined as follows: 

, , f minlLTl \ x E B* Sz fM(x) = w} if such x exists 
Km[W) = < ' .T, ■ 

^ ' [00 otherwise 

Similarly, one can define time bounded and space bounded Kolmogorov com- 
plexities of a word w E A* . Namely, for a Turing machine M and a function 
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(a bound) define the space bounded and time bounded Kolniogorov complexities 
KSm(x,I3) and KTm{x, (3) (with respect to the bound (3) as follows: 



KSm{w,I3) - 



KTm{w,13) = 



min{|a;| | x G -B* & fi{x) — w &l sm{x) ^ if such x exists 

oo otherwise 

min{|a;| \ x £ B* k, fi{x) — w k, tM{x) ^ /3(|a;|)} if such x exists 

oo otherwise 



These definitions depend on a given Turing machine M. It turns out that a 
universal Turing machine M provides an optimal notion of Kolmogorov complexity. 
Namely, the following invariance theorem holds. 



Invariance Theorem I ([21| |13], [^). There exists a Turing machine U such 
that for any Turing machine M and for any word w € A* the following inequality 
holds: 

Kuiw) ^Km{w) + Cm, 
where Cm is a constant which does not depend on w. 

Similar results hold for space bounded and time bounded Kolmogorov com- 
plexities. 



Invariance Theorem II [11]. There exists a Turing machine U such that for any 
Turing machine M , for any bound (3 : J\f — > M, and for any word w £ A* the 
following inequalities hold: 

KSuiw, CMfi) ^ Km{w, /?) + Cm, 
KTu{w, CMl3\og{l3) ^ Km{w,P) + Cm, 
where Cm is a constant which does not depend on w. 

The theorems above allow one to consider Kjj{w) as the Kolmogorov complexity 
K(w) of a word w G A* , were U is an arbitrary fixed optimal machine. 

We will use these different types of Kolmogorov complexities in constructing 
measures on a free group. But first, two remarks are in order (a bad news and a 
good news). 

Remark 4.1. The function w — > K{w) is not recursive. Thus, we cannot 
effectively compute the Kolmogorov complexity of a given word. However, it turns 
out that we can estimate K{w) by the length of w. This shows that the length \w\ 
as a complexity of w is back in the game. 



Remark 4.2. There exists a constant C such that for any word w £ A* 

K{w) ^ \w\\ogM\+C. 

Actually, there are better estimates which will be discussed elsewhere. 

Now we define Kolmogorov complexity of an element of a free group F — F{X) 
with a basis X. Let = {x"^ \ x £ X} and A = X VJ X~^ . For an element 

f € F denote hy wj the unique reduced word in the alphabet A which represents 
/• 

Now the Kolmogorov complexity K{f, X) of /, with respect to the basis X, is 
defined as 

K{f)^Kiwf). 
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It is easy to see that for different bases X and Y of F, the corresponding 
Kolmogorov complexities are equivalent up to some additive constants, i.e., there 
are positive integers Ci and C2 such that for any f E F, 

K{f, y) - Ci ^ K{f, X) ^ K{f, Y) + C2. 

This allows us to fix an arbitrary basis of the free group F and consider all the 
Kolmogorov complexities with respect to this particular basis. 

5. Kolmogorov complexity functions on finitely generated groups 

Let G be a group generated by a finite set X. There are several ways of 
introducing Kolmogorov complexity on G. 

Method I. Let F{X) be a free group on X and let r? : F{X) — ^ G he the 
canonical epimorphism. For an element g G G define the Kolmogorov complexity 
K{g, X) of g with respect to X as follows 

K{g, X) = mm{K{w, X) \ r]{w) = g}, 

i.e., the Kolmogorov complexity oi g G G is the minimum of Kolmogorov complex- 
ities of representatives of g with respect to X. 

As we have already seen, K{g, X) can be estimated from above by the geodesic 
length of g in the Cayley graph of G with respect to the set X of generators. It is 
extremely difficult to deal with this type of complexity if the set of such geodesies is 
itself very complex. In this event, it might be useful to consider some special normal 
forms of elements. For example, this is the case for braid groups with respect to 
the set of Artin generators. 

Method II. Suppose the group G has a set of normal forms V, i.e., there exists 
a subset V C F such that 77 jy is one-to-one. Then one can define Kolmogorov 
complexity oi g G G with respect to V as 

Kig,V) = Kir'Wig)). 

The following method provides an average Kolmogorov complexity with respect 
to a given measure on F. 

Method III. Let 77 : F — > G be as above. Let d : M — > i? be a probability on 
N and c : F — > M a Kolmogorov complexity function on i^. As we have discussed 
above, there exists a probability measure = /ic,d on the group F. Define an 
average Kolmogorov complexity of g G G with respect to d, c, and X as follows: 

j..4(M,o.x)= ^y--''''''°'7';' . 

6. Short elements bias and behaviour at infinity 

In this section we discuss how to deal with short elements bias in an infinite 
group. 

Let G be a finitely generated group with a finite sot X of generators. For 
simplicity we will discuss only measures corresponding to the length function Ix 
on G, but similar arguments can be applied for other complexity functions as well. 
Let 

Cn = {geG\ Ixig) - n}, S„ = {g e G | lx{g) ^ n} 
Let jj be an atomic function on G. Since ^lJ-{Ck) = 1, we have ii{Ck) — > 
as k — > 00. Therefore elements of bigger length in a set 5 C G contribute less 
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to fJ,{S), i.e., we witness the short elements bias. On one hand, this meets our 
intuitive expectations because, technicaUy, random elements of a very big length 
are inaccessible to us, e.g., no computer can generate for us a random element of 
length > 10^"°. On the other hand, it may happen that only a few short elements 
essentially define the measure of an infinite set. 

There are several approaches to deal with the short elements bias. 

Method I. 

In practical computations with groups, when we wish to evaluate performance 
of a given algorithm A, a typical solution to this problem is the following. Choose 
a sufficiently big random positive integer n (or several of them), generate (pseudo) 
randomly and uniformly enough elements of length n, and run your algorithm on 
the produced inputs. The choice of n usually depends on the computer resources 
and the hardness of the algorithm. Mathematically this can be modelled by the 
probability distribution /ii^.d,, with Dirac density dn- The only problem is the 
choice of n. Theoretically, to avoid the bias toward short elements, one has to take 
the hmit when n ^ oo. More precisely, let i? be a subset of G. Denote by Pn{R) 
the measure of R with respect to Dirac density concentrating at n: 

Pn W i^^i • 

Then the asymptotic behaviour of the set R (with respect to Dirac densities) can 
be characterized by the following limit (if it exists): 

P^^HR)= hm p(f)(i?) 

n — >QC 

This limit is called the shperical asymptotic density of the set R. 

Similarly, if we measure R with respect to the disc uniform density function 
(with respect to the complexity function Ix)' 



\Bu\ ' 

and then take the limit (if it exists) 

p(^\R) = lim pW(i?) 

n — ^oo 

then we get the disc asymptotic density of the set R. 



In Section 10 we discuss asymptotic densities in detail, here it is worthwhile 
to mention only that these characteristics are not sensitive enough to distinguish 
various subsets of groups. For example, all subgroups of infinite index have the 
same asymptotic density (disc or spherical) equal to 0. 

Method II. 

Let /i be a probability distribution on G. Then the mean length L^^x of ele- 
ments of G with respect to p and the set X of generators is the expected value of 
the length function Ix- 

L^i.x = ^ lx{g)K9)- 

For example, if G = F{X) and a measure fj,\ = pix.dx given on F{X) by the 
exponential density: 

dA(fc)-(l-e-^)e-^'= 
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then the mean length L\ of words in F distributed according to nx is equal to 

oo 

iA- 5]klMH = (l-e-^)^fc(e 



toeF k=l 



fe=i ^ ^ 

Hence L\ ^ oo when A ^ 0. Thus, we have 

a family of probabilistic distributions {/xa}a with parameter A such 
that the mean length L\ tends to oo when the parameter A approaches 
0. 

Similar results hold for all other distributions introduced in Section ^ (after 
renormalizing the parameters). 

Now one can measure the behaviour of R at infinity by the following limit (if 
it exists) 

/^oo(^) = lim Ma(^)- 

A — *0 

A study of limits of this type was initiated in |^ . 

Note that if {iik}k is the Dirac parametric family of distributions, then 
and if {fik}k is the parametric family of disc uniform distributions, then 

7. Degrees of polynomial growth "on average" 

Let c : Fn — > R be a complexity function and fj, the measure moderated by 
the Cauchy distribution 

The advantage of this distribution is that it allows to measure degrees of polynomial 
growth of functions on Fn "on average" in the following sense. 

Let / : Fn — > R be a non-negative real valued function. We say that / has 
polynomial growth of degree a ^ if a is the greatest lower bound of the set of 
real positive numbers /3 such that the mean value of ^(^{^y}-i is finite, that is, 

f „ f(,iA 
(7.1) a = inf <^ (3 



I Nfl^i A*c,d(w converges 
weF„ ^ ' 



It follows immediately from the construction of our measure that the function c(w)"^ 
has growth of degree m. (Just recall that the series J2 ;riT7 converges for all e > 0, 
while the harmonic series J2 n diverges.) 

In particular, if /(x) is the running time of some algorithm with input x, this 
definition allows us to make meaningful statements like "the algorithm works in 
cubic time on average" . 

Now we are going to generalize this situation and try to find sufficient conditions 
to define the degrees of growth of functions on an arbitrary infinite factor group of 

Fn. 
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Let G = Fn/R be an infinite factor group of the free group F„ and r] : F„ — > G 
the natural homomorphism. Wc shall list some natural conditions for the complex- 
ity function c on F„ and the moderating probability distribution d which allow us 
to define degrees of the average growth of functions on G. 

(Dl) {Existence of a complexity function on G.) The mean complexity 

c 5) = ^F=r^ 

is finite for every g G G. 
(D2) {Existence of degrees.) There exists a positive number r such that the series 

geG 

converges for a < r and diverges for a > t; p. here is the measure on G 
induced by the measure /ic,d on F„. 

If these conditions arc satisfied, wc can define the degree of growth of an arbitrary 
nonnegative real valued function on G as 




Now we at least have the (easy to check) property that the degree of growth of the 
function c{g)™ is m. 

Note that not every moderating probability distribution d on N U{0} is suitable 
for defining degrees of polynomial growth. For example, if we take in the definition 
of the degrees of growth on the free group F„, the exponential distribution 

d{k) = (1 - e-^) • e-^^ 

we note that the series 

^ c{xy iJ.c,d{^) 
xeF„ 

converges for all a. 

Question 7.1. Can one find a moderating probability distribution d such that 
the conditions Dl and D2 are satisfied for every factor group G = Fn/R of infinite 
index and the measure p, on G induced from fic,d ? 

The Cauchy distribution d{k) = ^ • p- is still on the list of candidates for the 
affirmative answer. 

Wc need to warn that the degree of growth of a function / on G is not necessarily 
equal to the degree of growth of its lift f o rj : Fn — > M. It might happen that, 
for certain problems, it is much more convenient to work in a free group than in 
its factor group G and evaluate the degrees of growth of the lifted function f o rj 
instead of those of /. Therefore general results concerning relations between degrees 
of growth on F„ and G = Fn/R would be rather interesting. 
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8. Measures generated by random walks 

8.1. Random walks. An interesting and, in some cases, easier to analyze 
class of measures is related to random walks on Cay ley graphs of groups. From an 
algorithmic point of view, the most natural way to produce a random element in 
a finitely generated group is to first make a random word on the generators, and 
then apply relations. This is, in disguise, a random walk on the Cayley graph of 
a given group. This becomes especially relevant when hardware random numbers 
generators are used to produce random words. 

Let us look at this basic procedure in more detail. 

Let Fn be a free group on free generators Xi, . . . , a:„. We can associate with it a 
free monoid M„ with the generators Xi, Xi, ... , a;„, X„ and the natural homomor- 
phism TT : M„ — > Fn which sends Xi to Xi and Xi to x~^ . A random walk of length 
I on the Cayley tree of Fk which starts at the identity 1 is naturally described by 
a word of length I in M„. 

In essence, we take M„, not Fn, as the ambient algebraic structure, and in- 
troduce measures and complexity functions on M„ rather than on F„. There is 
a compelling evidence that, in at least some problems, it might be convenient to 
work in the ambient free monoid M„. For example, it appears that physicists prefer 
to use the random walk approach in their study of statistics of braids, knots and 
heaps, the latter being closely related to locally free groups 

CTn - (/i, ...Jul [h, fj] = 1 for 1* - il > 1) 
(see e.g. |^ |^ ^, 23]). In any case, the physical process of the accumulation of 



soot (in the 2-dimcnsional case) is most naturally modeled by random walks on the 
monoid of positive words in CTn- 

Let d : N U {0} — > R be a moderating probability distribution. Following the 
analogy with our constructions for a free group, we introduce an atomic probability 
measure jl on M„ by assigning equal probabilities to words of equal complexity and 
the total measure d{k) to the set of words of complexity k. This can be interpreted 
as running random walks on Fn of random complexities k distributed with the 
probability density d{k). The probability for a random walk to stop at the element 
X G Fk defines a measure fj,{x) on Fk. Obviously, 

tt{'w)—x 

8.2. How is this new measure related to Kolmogorov complexity 
measures? The measure that we have just defined should have properties very 
close to Kolmogorov complexity measures described in Section ^ The word x = 
t:{w) is obtained from a word w by cancelling all adjacent pairs of elements of the 
form XiXi, which amounts to running a certain Turing machine on the input word 
w. Therefore, in view of the previous discussion, the Kolmogorov complexity c{x) 
of X differs from the Kolmogorov complexity c{w) of w by at most an additive 
constant: 

c{x) < c{w) + C. 

Since for most words of a fixed big length I, Kolmogorov complexity is close to 
the length of the word, we should expect roughly the same asymptotic behaviour 
from a random walk measure as we do from a measure associated with Kolmogorov 
complexity. 
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8.3. Comparing the length of a random walk with the geodesic length. 

In a simple case, where the complexity function on M„ is just the usual length func- 
tion l{w), we can similarly compare its behaviour with the behaviour of the geodesic 
length I — lgeod{x) of the end point x = tt{w) of the walk on the Cayley tree r(F„) 
of Fn, described by the word w. 

It is easy to see that the function x i— > Zgcod(a;) maps a random walk on r(_F'„) to 
a non-symmetric random walk yV+ on the set N U {0} of nonnegative integers with 
reflection at 0. In this new walk, we make steps of length 1; we move from the point 
I = to the right with probability 1, and, from any other point / ^ 0, we move to 
the right with the probability p = and to the left with the probability 9 = ^ • 
Obviously, the mean value of I is at least the mean value of / for a random walk W 
on Z without reflection at 0; in this walk we start at and move by 1 to the right 
with the probability p and by 1 to the left with the probability q. In W, we make 
TO moves to the right with the probability (^^) • p™ • q'^-"^ and end up at the point 
I' = m — (k — m) = 2m — k. Thus, the random variable I' is a linear transformation 
of the random variable to distributed according to the binomial distribution. Since 
the expectation E{m) = pk, we deduce that E{1') = 2pk — k = ^^^fc- 

Therefore, for the random walk W+, the expected value of I is bounded from 
below by ^k = E{1') E(l). The upper bound Eil) ^ k \s obvious. Thus, the 
expected geodesic length E(lgf,od{x)) of an element x G Fn produced by a random 
walk of length k is estimated as 

n~ I 

k < E{lf,cod[x)) ^ k. 



9. Behaviour of the induced measures on finite factor groups 

In this section we try to establish possible criteria to evaluate how "natural" a 
given measure is, i.e., how it matches our expectations of what the probability of 
hitting particular sets should be. In general, these expectations may, of course, vary 
from individual to individual, but we have some common grounds, at least, in the 
case of finite sets. So, our idea is to set up some tests to compare a given measure 
/i on a free group F = Fn with the induced measures on finite quotients of F . 
Most people will probably agree that, for example, the probability for a randomly 
chosen element of F to have even length should be about 1/2. More generally, it is 
natural to expect that a subgroup _ff < of an index m would have the measure 
(approximately) equal to ^. Unfortunately, this is not the case if /i(l) > 0, indeed, 
if » RT) then /i(i/) > » ^. 

Moreover, most people working with probabilities on finite groups are likely to 
prefer measures which are well behaved with respect to taking "big" finite factors 
Fn/ R, for example, measures which yield reasonable bounds for the total variance 
distance 

(9.1) \ E 

of the induced distribution from the uniform distribution on Fn/ R. Again, this 
natural condition cannot be satisfied basically for the same reason, namely, the 
value of ( ^j| ) is bounded from below by ^|/i(l) — (l/m)|, where m = |F„ : and 
does not converge to as to — > oo. 



^'^-W-R\ 
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This is a typical example of an unexcusable dependance of a measure on short 
elements. This is one of the many reasons to believe in the following (maybe 
controversial) metamathematical thesis: 

There is no particular atomic measure on an infinite group which would 
meet our expectations of sizes of particular sets in the group 

A possible practical outcome of this thesis is cither to consider a whole family 
of parametric measures on a group instead of a fixed one, or to adjust our criteria 
to allow a margin of an error of approximation. The first approach was developed 
to some extent in , here we make an attempt to consider the second one. 

One may wish to exclude anomalously big probabilities of short elements by 
taking the measure of an element gR in a finite factor group Fn/R to be the 
renormalized measure of the set of "large" elements in gR: 



(9.2) 



/i((f„ X Bi) n gR) 
KFn \ B,) 



where Bi is the ball of radius I centered at 1. Anyway, it is only natural to assume 
that, when assessing the average case complexity of algorithmic problems of prac- 
tical interest, we are working with words of sufficiently large size; the bias towards 
short elements can be safely ignored. A measure /i can be accepted as "good" if, 
for values of I much smaller than m, 



(9.3) 



2 ^ 

geF„/R 



1 



|F„ : R\ 



1 

< 

e 



or is bounded by some other reasonable constant, or decreases exponentially with 
the growth of I: 



(9.4) 



- E 

2 



W(5) - 



1 



Fn : R\ 



o(e-^') 



The reader probably feels already at this point that we are leaning towards the 
classical concept of a random walk on a group. Taking images of "sufficiently 
long" elements means allowing a sufficiently long random walk on the finite group 
Fn/R. It is reasonable to take the mixing time of the random walk on the factor 
group Fn/R with respect to t he generators xiR, . . . ,a;„i? for the characteristic 
word length I in the criterion (9^). Recall that the mixing time is defined as the 
minimum such that 



(9.5) 



|P/„ - U\\ = 



1 



E 



Pioia) 



\Fn : R\ 



< 



1 



where Pi (g) is the probability for a random walk of length I to end up at the element 
g, and U is the uniform distribution on Fn/R. From the mixing time on, a random 
walk distribution converges to the uniform distribution exponentially fast: 



(9.6) 



\Pkio-u\\ = \ Yl 



geF„/R 



Pkioig) 



1 



\Fn : R\ 



Therefore, we can consider an alternative criterion for good behaviour of the 
measure on finite factor groups: 
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(CI*) For every finite factor group G — Fn/R there exists a number such that 

1 



F,, : R\ 



Of course, the value of /q ("the mixing time") is all important. One should 
expect from a 'good' measure on Fn that it is approximately the same as the 
mixing time of a random walk on F„/R. It is worth mentioning here, that, by a 



result by Pak [19|, a random walk on a finite group G with respect to a random 
generating set of size 0(log |G|) mixes under 0(log \G\) steps. 

It is relatively easy to see that our criterion (CI*) is met by a wide class of 
measures associated with the length function on Fn. 

Theorem 9.1. // the moderating distribution d{k) satisfies a rather natural 
condition d{k) > for infinitely may values of k then, for every finite factor group 
G — Fn/R, the induced measure fti on G satisfies the condition (CI*). 



9.1. Proof of Theorem 9.1. Let \G\ = m. Every probability distribution 
on G is a vector pi, . . . ,Pm of non-negative real numbers subject to the condition 
Pi + ■ ■ ■ + Pm — 1- Thus, the set of all probability distributions on G is the convex 
hull A of the distributions Eg concentrated at g E G, i.e., Eg{h) = 1 \i h = g and 
otherwise. We introduce on A the total variance metric 



2 

geG 

A step of a random walk induces an affine transformation r : M'" — > K™ by the 
rule 

h adjacent to g 

where the adjacency is considered in the Cayley graph of G. 

Since all vertices of the Cayley graph of G have the same degree, r fixes the 
uniform distribution U = (l/m, . . . , 1/m). By somewhat abusing the notation, we 
can move the origin of the coordinate system in K.™ to the point U. It will be 
convenient to restrict the action of t to the subspace R'"^^ spanned in M™ by the 
polytope A. Then the distance — [/|| becomes a norm on R'"^^ (which we denote 
by \\X\\; hopefully, this will not cause a confusion), and r becomes a linear operator 
on M™^-'^. Note that the norms ||r'^(£'g)|| of vertices of the convex polytope t'^(A) 
are all equal. Since t maps the vertices Eg of A to points of A, it does the same 
to the vertices of r'^(A). As a result, we have the following monotonicity property: 

\\T''+\Eg)\\^\\r^{Eg)\\. 

We can use again the property that t maps the vertices Eg of the convex 
polytope A to prove that ||r(i?g)||£; < lli^glls for all g & G, where |j \\e stands for 
the standard Euclidean norm on K™"^. It follows that 

11^11^= niax ^^^<1, 
" xeA,x^o \\X\\e 

hence |jr'^(X)||£; — > as fc — > oo for every distribution X. Moreover, |jT'^(X)|j£; 
decreases exponentially: ||t''(X)||£; = o{c''). Since any two norms on a finite di- 
mensional space are equivalent, we have 

^\\X\\e<\\X\\<C\\X\\e 



MEASURING SETS IN INFINITE GROUPS 



17 



for some constant C, and it follows that ||t'''(X)|| — o{c^). 
We have, in our new notation, 



geG 
1 



1 



1 



Y.d{k)T\E,) 



k>l 



1 



Y.dik)r\E,) 



\r'{Ei)\\ 



Ek>idik) 



\r\E,)\\ 



We see that the probabilistic distribution on G produced by a random walk of 
random length ^ / converges to the uniform distribution as fast as the probability 
distribution after I steps of the usual random walk does. In particular, this proves 
that the measure fii on G satisfies the condition (CI*). 



10. The growth function and asymptotic density 

In this section, we discuss an approach to analyze the asymptotic behaviour of 
sets in a group via asymptotic densities introduced in Section ^. 

Let G be a group with a complexity function c : G M (for example, a group 
generated by a finite set X with the length function Ix)- 

A pseudo-measure on G is a real- valued nonnegative additive function defined 
on some subsets of G. The pseudo-measure /i is called atomic if fJ-{S) is defined for 
any finite subset S of G. Let Gk and Bf, be, respectively, the sphere and the ball 
of radius A; in G with respect to the complexity c. 

For a set R C F, we define its spherical asymptotic density with respect to fi 
as the following limit (if it exists): 



p(«)(i?)= hm priR), 



is). 



where 



A^(i?nGfe) 



/i(Cfc) 

Similarly, we define disc asymptotic density of R as the limit (if it exists) : 



(i?)= hm pi'\R), 

k — '■oo 



where 



p{Bk) 

One can also define the density functions above using limsup rather then lim. 

For example, if /i is the cardinality function, i.e., p.{A) = \A\, then we obtain 
the standard asymptotic density functions p*^^^ and p'^'^'^ on G. 

Moreover, if p is c-invariant, i.e., p{u) = p{v) provided c{u) — c{v), then the 
spherical asymptotic density with respect to p is equal to the standard spherical 
asymptotic density: 
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Lemma 10.1. For any atomic pseudo-measure fi on G, the asymptotic densities 
p^'^' and p^'^^ are also atomic pseudo-measures on G. 

Proof is obvious. 

Lemma 10.2. Let p, be a pseudo-measure on G. Suppose that 

lim i-i{Bk) — oo. 

k — *oo 

Then, for any subset R Q G, if the spherical asymptotic density p]^^ (i?) exists, then 
the disc asymptotic density p^if \R) exists, too, and 

P^^\R) = pi'\R). 

Proof. Let xu = p{R n Bk) and yk = p{Bk). Then yu < yu+i and limj/fc = oo. 
By Stolz's theorem 

1. 1. •^k ^k—1 

lim — — lim . 

fe^oo j/fc fc^oo yk — yk-i 

Hence 

pI^HR) = lun l^i^^ ^ pi;HRl 

as claimed. 

In view of this result we will refer to the standard (spherical or disc) densities 

P- 

Asymptotic densities provide a useful, though very coarse, tool to describe 
behaviour of sets at infinity. Furthermore, there are very natural subsets which are 
sadly unmeasurable with respect to p. To see this, consider the set En of words of 
even length in a free group Fn of rank n. Then p{En) is easily seen to be undefined. 
A way around this problem (involving generalized summation methods for series) 
is discussed in |^ . 

Moreover, the following well-known result shows that that p is not sufficiently 
sensitive. 



Theorem 10.3. (Woess ||26[) If N is a normal subgroup of Fn, n ^ 2, of 
infinite index, then p{N) — 0. 

Another disappointment is offered by the following result. As usual, we call 
an element u G Fn primitive if it is part of a free basis of Fn, or, equivalently, if 
a{u) = x\ for some a £ Aut(Fn\ 

Theorem 10.4. Let Fn be the free group of a finite rank n ^ 2. Then: 

p(Prn) = 0, 

where Pr„ is the set of all primitive elements of the group Fn ■ More precisely, if 
P{n, k) is the number of primitive elements of length k in Fn, n > S, then for some 
constants ci, C2, one has 

ci • (2n - 3)'' < P{n, fc) < ca • (2n - 2)'=. 



We see that the asymptotic density p is not sensitive enough in measuring sets 
in Fn ■ It would be very interesting therefore to check whether probabilistic results 
0], I IS], that are based on the asymptotic density p, will hold upon 



of. 



say. 



replacing p with a more adequate measuring tool. 
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10.1. Proof of Theorem 10.4, Our proof is based on the fact that the 
Whitehead graph of any primitive element of length > 2 has either an isolated 
edge or a cut vertex, i.e., a vertex that, having been removed from the graph to- 
gether with all incident edges, increases the number of connected components of 
the graph. Recall that the Whitehead graph Wh{u) of a word u is obtained as 
follows. The vertices of this graph correspond to the elements of the free generating 
set X and their inverses. If the word u has a subword XiXj, then there is an edge 
in Wh(u) that connects the vertex Xi to the vertex a;~^; if u has a subword Xixj^, 
then there is an edge that connects Xi to xj, etc. We note that usually, there is one 
more edge (the external edge) included in the definition of the Whitehead graph: 
this is the edge that connects the vertex corresponding to the last letter of u, to 
the vertex corresponding to the inverse of the first letter. 

Assume first that the Whitehead graph of u has a cut vertex. We are going 
to show that the number of elements u of length k with this property is no bigger 
than C ■ (2n — 2)*^"^, where C = C{n) is a constant. 

Let Wh(u) be a disjoint union of two graphs, Fi and complemented by 
a (cut) vertex A together with all incident edges. Let ni ^ 1 and 712 ^ 1 be the 
number of vertices in Fi and respectively. Then, in particular, ni +712 = 2n — 1. 
Let TO = min(ni,n2). 

The first letter of u can be any of the 2n possible ones. For the following letter 
however we have no more than 2n — m possibilities since, for example, if the first 
letter, call it Xi, corresponds to a vertex from Fi, then the following letter, call it 
Xj, cannot be such that the vertex corresponding to xj^ belongs to F2, because 
otherwise, there would be an edge in Wh{u) that connects Fi to F2 directly, not 
through A, i.e., A would not be a cut vertex. 

Thus, there are only 2n — 712 ^ 2n — m possibilities for the following letter in 
u. The same argument applies to every letter in u, starting with the second one; 
therefore, the total number of possibilities (corresponding to a particular choice of 
ni and rt2) is no bigger than C ■ {2n — m)*^"^, where C — C{n) is a constant. 

Now consider two cases: 

(i) m ^ 2. 

In this case, the number in question is bounded by C ■ {2n — 2)*^"^. 
(ii) m — 1. 

In this case, one of the graphs, say, r2, consists of a single vertex (call it xi for 
notational convenience) connected only to the cut vertex (call it 0:2); in particular, 
the vertex Xi is a terminal vertex of the graph Wh{u), and, whenever the letter xi 
occurs in u, it is followed by a;^^. Let g ^ 1 be the number of occurrences of xi in 
u. 

Apply the automorphism (jj : xi t—^ xiX2, Xi ^ Xi, i ^ 2, to the element u. 
Then ](/'(«) | — \u\ — q < \u\. Now (j){u) is a primitive element, too; hence the whole 
argument above is applicable to 4'{u). In particular, if, in the notation above, m ^ 2 
for this element (piu), then, as we have just proved, the number of those elements 
is bounded by C ■ (2n — 2)'^~^~' for some C = C{n). This number is also equal to 
the number of elements u of the type we are considering now (for a particular q) 
because of the one-to-one correspondence between elements u and 4>{u). 

If TO = 1 for the element (j){u), then we use the same trick again, until we 
get to a primitive element with to ^ 2. In any case, the number of primitive 
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elements of length k, with m = 1, and with q ^ 1 occurrences of xi, is bounded by 
C • (2n - 2)'=-i-9 for some C = C(n). 

Now we have to sum up for all possible values of q. Note that q cannot be equal 
to k since Xi is not a primitive element; also, q cannot be equal to fc — 1 since in 
the Whitehead graph of x'l^^X2, the vertex corresponding to xi is not a terminal 
vertex. Hence, we have 

J2 C • (2n - 2)'=-!-'' = C ■ ^^"^ ~ ^ ^ C • (2n - 2)'''\ 

q=l " 

Finally, we have to multiply this number by the number of ways we can choose 
two vertices (terminal and cut) out of 2n, i.e., by (2n — l)n, but this does not 
change the type of the bound. Therefore, we get here the same bound as we got in 
the case to ^ 2. 

Thus, summing up for all possible values of ni ^ 1 and n2 ^ 1 such that 
ni + n2 = 2n — 1, we see that the total number of possibilities in (i) and (ii) is 
bounded by C ■ {2n — 2)^ for some C = C{n). 

Finally, we address the remaining case, where the Whitehead graph of u has 
an isolated edge. In that case, some cyclic permutation of u must be of the form 
xf^^x^^ui, where ui does not depend on Xi, xj, and j does not have to be different 
from i. The number of elements with this property is easily seen to be bounded by 
C ■ k ■ {2n — 3)*^"^ for some constant C ~ C{n). 

Therefore, the ratio of the number of primitive elements of length k to the num- 
ber of all elements of length k is no more than C ■ 2Ti(2n-ij'=-i ~ ' ( 2n-i where 
C — C'{n) is a constant. This ratio obviously tends to 0, and, moreover, well- 
known properties of a geometric series now imply that the ratio of the number of 
primitive elements of length ^ k to the number of all elements of length ^ k tends 
to 0, too. 

Finally, we note that the lower bound ci • (2n — S)'^ < P{n, k) is obvious because 
every element of the form xi ■ u{x2, is primitive. 

Just to complete the picture, we also mention here the bounds for the number 
of primitive elements of length k in ^2^ 



Proposition 10.5. (|17|) The number of primitive elements of length k in the 
group F2 is: 

(a) more than ■ (V3)'° */ k is odd. 

(b) more than | • (-v/S)*^ */ k is even. 

Informally speaking, "most" primitive elements in F2 are conjugates of primi- 
tive elements of smaller length. This is not the case in F„ for n > 2, where "most" 
primitive elements are of the form u ■ x^^ ■ v, where u, v are arbitrary elements that 
do not depend on Xi. 

10.2. The rate of convergence of the asymptotic density. Let 5 be a 

subset of a free group Fn of rank n ^ 2. As we have mentioned in the Introduction, 
the growth rate of the set S is defined as the limit 



j{S) = limsup {/ pk{S), 

k—*-oo 
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where 

'"^^^ - -wr 

are the disc relative frequencies of S. The growth rate 7(5") is gauging the speed 
of convergence to zero of the sequence pk{S). Indeed, the standard results from 
calculus show that if 7(6') < 1, then p{S) = 0, and, moreover, the sequence Pk{S) 
converges to exponentially fast. 

A classical theorem by Grigorchuk |^] states that if < is a normal sub- 
group and Fn/N is not an amenable group, then 7(A^) < 1. This gives us many 
easy examples of normal subgroups N with the exponential speed of convergence 
of the asymptotic density. For example, this happens every time when the factor 
group Fn/N contains a non-abelian free group. 

It would be interesting to have a look at the other end of the spectrum and 
estimate the speed of convergence of the sequence pk{N) for an obviously "big" 
subgroup N < Fn- The following result is one of the very few instances where we 
have concrete information: 



Theorem 10.6. (Sharp [|20|) Ifn^2, then the spherical relative frequencies 

is) _ \[Fn,Fn]nSk\ 
Pk — 



\Sk\ 

of words of length k from the derived subgroup [Fn, Fn] of the free group Fn asymp- 
totically behave as 

(s) / if k is odd, 



'i^j2 ^ e.ven. 
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